SNR-dependent background noise compensation of PESQ values for cellular phone speech
نویسندگان
چکیده
To evaluate the speech quality of actual cellular phone systems with an objective assessment, PESQ values were compared with MOS values for speech with background noises via four cellular phone systems used in Japan. As PESQ value errors were observed to be SNR-dependent, two SNR-dependent background noise compensation methods for PESQ values are proposed. Applying the compensation methods to the speech for four cellular phone systems, the RMSEs between MOS and compensated PESQ values were reduced to less than half of the original RMSEs for all four cellular phone systems. They were equal to the level of RMSE of MOS values.
منابع مشابه
Speech enhancement based on hidden Markov model using sparse code shrinkage
This paper presents a new hidden Markov model-based (HMM-based) speech enhancement framework based on the independent component analysis (ICA). We propose analytical procedures for training clean speech and noise models by the Baum re-estimation algorithm and present a Maximum a posterior (MAP) estimator based on Laplace-Gaussian (for clean speech and noise respectively) combination in the HMM ...
متن کاملSpeech Enhancement Using Gaussian Mixture Models, Explicit Bayesian Estimation and Wiener Filtering
Gaussian Mixture Models (GMMs) of power spectral densities of speech and noise are used with explicit Bayesian estimations in Wiener filtering of noisy speech. No assumption is made on the nature or stationarity of the noise. No voice activity detection (VAD) or any other means is employed to estimate the input SNR. The GMM mean vectors are used to form sets of over-determined system of equatio...
متن کاملBlind source extraction based on a direction-dependent a-priori SNR
In many hands-free applications, we encounter a speaker located in the near-field embedded in diffuse far-field noise. In this paper, we contribute an algorithm to estimate the speech and noise power spectral density (PSD) based on a directiondependent SNR (DD-SNR). The only prior knowledge needed is a model of the diffuse noise sound field. The enhanced speech signal is obtained by a parametri...
متن کاملDistributed multichannel speech enhancement based on perceptually-motivated Bayesian estimators of the spectral amplitude
In this study, the authors propose multichannel weighted Euclidean (WE) and weighted cosh (WCOSH) cost function estimators for speech enhancement in the distributed microphone scenario. The goal of the work is to illustrate the advantages of utilising additional microphones and modified cost functions for improving signal-to-noise ratio (SNR) and segmental SNR (SSNR) along with log-likelihood r...
متن کاملFeature Compensation for Speech Recognition in Severely Adverse Environments Due to Background Noise and Channel Distortion
This paper proposes an effective feature compensation scheme to address severely adverse environments for robust speech recognition, where background noise and channel distortion are simultaneously involved. An iterative channel estimation method is integrated into the framework of our Parallel Combined Gaussian Mixture Model (PCGMM) based feature compensation algorithm [1]. A new speech corpus...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005